翻訳と辞書
Words near each other
・ Sequence (biology)
・ Sequence (disambiguation)
・ Sequence (filmmaking)
・ Sequence (game)
・ Sequence (geology)
・ Sequence (journal)
・ Sequence (medicine)
・ Sequence (music)
・ Sequence (musical form)
・ Sequence (post production)
・ Sequence alignment
・ Sequence analysis
・ Sequence And Batch Language
・ Sequence assembly
・ Sequence breaking
Sequence clustering
・ Sequence container (C++)
・ Sequence dance
・ Sequence database
・ Sequence dating
・ Sequence determination
・ Sequence diagram
・ Sequence feature variant type
・ Sequence Hills
・ Sequence hypothesis
・ Sequence labeling
・ Sequence learning
・ Sequence logo
・ Sequence motif
・ Sequence of events recorder


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Sequence clustering : ウィキペディア英語版
Sequence clustering
In bioinformatics, sequence clustering algorithms attempt to group biological sequences that are somehow related. The sequences can be either of genomic, "transcriptomic" (ESTs) or protein origin.
For proteins, homologous sequences are typically grouped into families. For EST data, clustering is important to group sequences originating from the same gene before the ESTs are assembled to reconstruct the original mRNA.
Some clustering algorithms use single-linkage clustering, constructing a transitive closure of sequences with a similarity over a particular threshold. UCLUST〔(【引用サイトリンク】title=USEARCH )〕 and CD-HIT〔(【引用サイトリンク】title=CD-HIT: a ultra-fast method for clustering protein and nucleotide sequences, with many new applications in next generation sequencing (NGS) data )〕 use a greedy algorithm that identifies a representative sequence for each cluster and assigns a new sequence to that cluster if it is sufficiently similar to the representative; if a sequence is not matched then it becomes the representative sequence for a new cluster. The similarity score is often based on sequence alignment. Sequence clustering is often used to make a non-redundant set of representative sequences.
Sequence clusters are often synonymous with (but not identical to) protein families. Determining a representative tertiary structure for each sequence cluster is the aim of many structural genomics initiatives.
== Sequence clustering algorithms and packages ==

* OrthoFinder:〔(【引用サイトリンク】title=OrthoFinder )〕 a fast, scalable and accurate method for clustering proteins into gene families (orthogroups)
* UCLUST in USEARCH〔
* CD-HIT〔
* nrdb90.pl
* TribeMCL: a method for clustering proteins into related groups
* BAG: a graph theoretic sequence clustering algorithm〔http://bio.informatics.indiana.edu/sunkim/BAG/〕
* JESAM:〔(【引用サイトリンク】title=Bioinformatics Paper: JESAM: CORBA software components for EST alignments and clusters )〕 Open source parallel scalable DNA alignment engine with optional clustering software component
* UICluster:〔http://ratest.eng.uiowa.edu/pubsoft/clustering/〕 Parallel Clustering of EST (Gene) Sequences
* BLASTClust single-linkage clustering with BLAST〔(【引用サイトリンク】title=NCBI News: Spring 2004-BLASTLab )
* (Multi)netclust:〔(【引用サイトリンク】title=WUR Multi-netclust web server )〕 fast and memory-efficient detection of connected clusters in (multi-parametric) data networks
* Clusterer:〔(【引用サイトリンク】title=Clusterer: extendable java application for sequence grouping and cluster analyses )〕 extendable java application for sequence grouping and cluster analyses
* PATDB: a program for rapidly identifying perfect substrings
* nrdb:〔http://web.archive.org/web/20080101032917/http://blast.wustl.edu/pub/nrdb/〕 a program for merging trivially redundant (identical) sequences
* CluSTr:〔http://www.ebi.ac.uk/clustr/〕 A single-linkage protein sequence clustering database from Smith-Waterman sequence similarities; covers over 7 mln sequences including UniProt and IPI
* ICAtools〔(【引用サイトリンク】title=Introduction to the ICAtools )〕 - original (ancient) DNA clustering package with many algorithms useful for artifact discovery or EST clustering
* Virus Orthologous Clusters:〔(【引用サイトリンク】title=VOCS - Viral Bioinformatics Resource Center )〕 A viral protein sequence clustering database; contains all predicted genes from eleven virus families organized into ortholog groups by BLASTP similarity
* Skipredudant EMBOSS tool〔(【引用サイトリンク】title=EMBOSS: skipredundant )〕 to remove redundant sequences from a set



抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Sequence clustering」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.